FLAT: Constructing a CLARIN Compatible Home for Language Resources

نویسندگان

  • Menzo Windhouwer
  • Marc Kemps-Snijders
  • Paul Trilsbeek
  • André Moreira
  • Bas Van der Veen
  • Guilherme Silva
  • Daniel Von Reihn
چکیده

Language resources are valuable assets, both for institutions and researchers. To safeguard these resources requirements for repository systems and data management have been specified by various branch organizations, e.g., CLARIN and the Data Seal of Approval. This paper describes these and some additional ones posed by the authors’ home institutions. And it shows how they are met by FLAT, to provide a new home for language resources. The basis of FLAT is formed by the Fedora Commons repository system. This repository system can meet many of the requirements out-of-the box, but still additional configuration and some development work is needed to meet the remaining ones, e.g., to add support for Handles and Component Metadata. This paper describes design decisions taken in the construction of FLAT’s system architecture via a mix-and-match strategy, with a preference for the reuse of existing solutions. FLAT is developed and used by the a Institute and The Language Archive, but is also freely available for anyone in need of a CLARIN-compliant repository for their language resources.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Creating & Testing CLARIN Metadata Components

The CLARIN Metadata Infrastructure (CMDI) that is being developed in CLARIN (Common Language Resources and Technology Infrastructure) is a computer-supported framework that combines a flexible component approach with the explicit declaration of semantics. The goal of the Dutch CLARIN project “Creating & Testing CLARIN Metadata Components” is to create metadata components and profiles for a wide...

متن کامل

The CLARIN Research Infrastructure: Resources and Tools for eHumanities Scholars

CLARIN is the short name for the Common Language Resources and Technology Infrastructure, which aims at providing easy and sustainable access for scholars in the humanities and social sciences to digital language data and advanced tools to discover, explore, exploit, annotate, analyse or combine them, independent of where they are located. CLARIN is in the process of building a networked federa...

متن کامل

CLARIN and Free Open Source Finite-State Tools

CLARIN stands for Common Language Resources and Technologies Research Infrastructure and it is one of the 35 infrastructure projects listed in the ESFRI roadmap of European research infrastructures for various areas. CLARIN has now entered its 3 year preparatory phase under a grant from the EU Commission. The preparatory phase of CLARIN has 32 partner organizations, (see www.clarin.eu for more ...

متن کامل

Resources, Tools, and Applications at the CLARIN Center Stuttgart

This NECTAR track paper (NECTAR: new scientific and technical advances in research) summarizes recent research and curation activities at the CLARIN center Stuttgart. CLARIN is a European initiative to advance research in humanities and social sciences by providing language-based resources via a shared distributed infrastructure. We provide an overview of the resources (i.e., corpora, lexical r...

متن کامل

CLARIN: Common Language Resources and Technology Infrastructure

This paper gives an overview of the CLARIN project [1], which aims to create a research infrastructure that makes language resources and technology (LRT) available and readily usable to scholars of all disciplines, in particular the humanities and social sciences (HSS).

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016